RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Date Processed by STIC: 1/ ( J Zoo 



Application Serial Number: 
Source: f /t o<=> ' 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-41 19 (R. Wax) ^ 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/ofrices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto,gov/ebc/efs/downloads/documeats.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 



U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Or 



Revised 01/29/2002 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



- 1 Wrapped Nucleics 

Wrapped Aminos 



The number/text at the end of each line '"wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



_ Variable Length 



Patentln 2.0 
" "bug" 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequences) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Sequencers) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) . SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequence(s) 



(NEW RULES) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 



10 



11 



12 



_Use of n'sor Xaa*s 
(NEW RULES) 



Jnvalid <213> 
Response 



Use of<220> 



_PatentIn 2.0 
"bug" 



Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <21 3> response is Unknown or 
is Artificial Sequence 



Sequencers) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <21 3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to Jloppy disk. 
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1600 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/586 , 744B 



DATE: 02/01/2002 
TIME: 08:56:52 



C--> 



3 
5 
7 
9 
10 
12 
14 
16 
17 
18 
19 
21 
22 
24 
26 
27 
29 
30 
32 
33 
35 
36 
38 
39 
41 
42 
44 
45 
47 
48 
50 
51 
53 
54 
56 
57 
59 
60 
62 
63 
65 
66 
68 



Input Set : A:\9584-017.txt . 
Output Set: N:\CRF3\02012002\I586744B.raw |)0©S Not GOfTipiy 

Corrected Diskette 



<110> APPLICANT: Harrington, et al 
<120> TITLE OF INVENTION: Mammalian Flap Specif ic-Endonuclease 
<130> FILE REFERENCE: 9584-017 

<140> CURRENT APPLICATION NUMBER: 09/586 , 744B 
<141> CURRENT FILING DATE: 2000-06-02 
<160> NUMBER OF SEQ ID NOS : 74 
<170> SOFTWARE: Patentln version 3.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 380 
<212> TYPE: PRT 
<213> ORGANISMCArtificial 
<22 0> FEATURE 



The type of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 




<223> OTHER INFORMATION 

<400> SEQUENCE: 1 V ^ ^ rrj^T J'u.^^e^y fA&^f\ 

Met Gly lie Gin Gly Leu Ala Lys Leu lie Ala Asp Val Ala Pro *5er 

15 10 15 

Ala lie Arg Glu Asn Asp lie Lys Ser Tyr Phe Gly Arg Lys Val Ala 

20 25 30 

lie Asp Ala Ser Met Ser lie Tyr Gin Phe Leu lie Ala Val Arg Gin 

35 40 45 

Gly Gly Asp Val Leu Gin Asn Glu Glu Gly Glu Thr Thr Ser His Leu 

50 55 60 

Met Gly Met Phe Tyr Arg Thr lie Arg Met Met Glu Asn Gly lie Lys 
65 70 75 80 

Pro Val Tyr Val Phe Asp Gly Lys Pro Pro Gin Leu Lys Ser Gly Glu 

85 90 95 

Leu Ala Lys Arg Ser Glu Arg Arg Ala Glu Ala Glu Lys Gin Leu Gin 

100 105 110 

Gin Ala Gin Ala Ala Gly Ala Glu Gly Glu Val Glu Lys Phe Thr Lys 

115 120 125 

Arg Leu Val Lys Val Thr Lys Gin His Asn Asp Glu Cys Lys His Leu 

130 135 140 

Leu Ser Leu Met Gly lie Pro Tyr Leu Asp Ala Pro Ser Glu Ala Glu 
145 150 155 160 

Ala Ser Cys Ala Ala Leu Val Lys Ala Gly Lys Val Tyr Ala Ala Ala 

■ 165 170 175 

Thr Glu Asp Met Asp Cys Leu Thr Phe Gly Ser Pro Val Leu Met Arg 

180 185 190 

His Leu Thr Ala Ser Glu Ala Lys Lys Leu Pro lie Gin Glu Phe His 

195 200 205 

Leu Ser Arg lie Leu. Gin Glu Leu Gly Leu Asn Gin Glu Gin Phe Val 

210 215 220 

Asp Leu Cys lie Leu Leu Gly Ser Asp Tyr Cys Glu Ser lie Arg Gly 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/5 86 , 744B 



DATE: 02/01/2002 
TIME: 08:56:52 



Input Set : A:\9584-017.txt 

Output Set: N:\CRF3\02012002\I586744B.raw 



69 


22 5 










230 










235 










240 


71 


Tip 

X X ~ 


Gly 


Pro 


Lys 


Arg 


Ala 


Val 


Asp 


Leu 


He 


Gin 


Lys 


His 


Lys 


Ser 


He 


72 










245 










250 










255 




7 A 


(Z 1 11 

oi. u. 


Glu 


He 


Val 


Arg 


Arg 


Leu 


Asp 


Pro 


As n 


Lys 


x y x 


Pro 


Val 


Pro 


Glu 


75 








260 










265 










270 






77 


ASH 


±J - P 


Leu 


His 


Lys 


Glu 


Ala 


His 


Gin 


Leu 


Phe 


Leu 


Glu 


Pro 


Glu 


Val 


78 






275 










280 










285 








ft n 


Leu 


ASp 


tr X 


r* I ii 


C /=i >~ 

OCX 


Val 


Ul u 


Leu 


Lys 




Ser 


Glu 


Pro 


As n 


Glu 


Glu 


ft 1 




290 










Zt ZJ -> 










300 










ft ^ 


Glu 


Leu 


He 


Lys 


Phe 






C\ v 
v3 x _y 


Glu 




Gin 


Phe 


Ser 


Glu 


Glu 


Arg 


84 


305 










310 










315 










320 


86 


He 


Arg 


Ser 


Gly 


Val 


Lys 


Arg 


Leu 


Ser 


Lys 


Ser 


Arg 


Gin 


Gly 


Ser 


Thr 


87 










325 










330 










335 




89 


Gin 


Gly 


Arg 


Leu 


Asp 


Asp 


Phe 


Phe 


Lys 


Val 


Thr 


Gly 


Ser 


Leu 


Ser 


Ser 


90 








340 










345 










350 






92 


Ala 


Lys 


Arg 


Lys 


Glu 


Pro 


Glu 


Pro 


Lys 


Gly 


Ser 


Thr 


Lys 


Lys 


Lys 


Ala 


93 






355 










360 










365 








95 


Lys 


Thr 


Gly 


Ala 


Ala 


Gly 


Lys 


Phe 


Lys 


Arg 


Gly 


Lys 










96 




370 










375 










380 










98 


<210> SEQ ID NO: 


: 2 
























99 


<211> LENGTH: 1144 




















r 




100 <212> TYPE: 


DNA 


















, i 








cDNA 



101 <213> ORGANISM: Artificial 

103 <220> FEATURE: 

104 <223> OTHER INFORMATION: 

106 <400> SEQUENCE: 2 

107 atgggaattc aaggcctggc caaactaatt gctgatgtgg cccccagtgc catccgggag 60 
109 aatgacatca agagctactt tggccgtaag gtggccattg atgcctctat gagcatttat 120 
111 cagttcctga ttgctgttcg ccagggtggg gatgtgctgc agaatgagga gggtgagacc 180 
113 accagccacc tgatgggcat gttctaccgc accattcgca tgatggagaa cggcatcaag 240 
115 cccgtgtatg tctttgatgg caagccgcca cagctcaagt caggcgagct ggccaaacgc 300 
117 agtgagcggc gggctgaggc agagaagcag ctgcagcagg ctcaggctgc tggggccgag 360 
119 caggaggtgg aaaaattcac taagcggctg gtgaaggtca ctaagcagca caatgatgag 420 
121 tgcaaacatc tgctgagcct catgggcatc ccttatcttg atgcacccag tgaggcagag 4 80 
12 3 gccagctgtg ctgccctggt gaaggctggc aaagtctatg ctgcggctac cgaggacatg 540 
125 gactgcctca ccttcggcag ccctgtgcta atgcgacacc tgactgccag tgaagccaaa 600 
127 aagctgccaa tccaggaatt ccacctgagc cggattctgc aggagctggg cctgaaccag 660 
129 gaacagtttg tggatctgtg catcctgcta ggcagtgact actgtgagag tatccggggt 720 
131 attgggccca agcgggctgt ggacctcatc cagaagcaca agagcatcga ggagatcgtg 780 
133 cggcgacttg accccaacaa gtaccctgtg ccagaaaatt ggctccacaa ggaggctcac 840 
135 cagctcttct tggaacctga ggtgctggac ccagagtctg tggagctgaa gtggagcgag 900 
137 ccaaatgaag aagagctgat caagttcatg tgtggtgaaa agcagttctc tgaggagcga 960 
139 atccgcagtg gggtcaagag gctgagtaag agccgccaag gcagcaccca gggccgcctg 1020 
141 gatgatttct tcaaggtgac cggctcactc tcttcagcta agcgcaagga gccagaaccc 1080 
14 3 aagggatcca ctaagaagaa ggcaaagact ggggcagcag ggaagtttaa aaggggaaaa 1140 
145 taaa 1144 

148 <210> SEQ ID NO: 3 

149 <211> LENGTH: 377 

150 <212> TYPE: PRT 
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RAW SEQUENCE LISTING DATE: 02/01/2002 

PATENT APPLICATION: US/09/586 , 744B TIME: 08:56:52 



Input Set : A:\9584-017.txt 

Output Set: N:\CRF3\02012002\I586744B.raw 



S 1 £ 1 


<213> ORGANISM: 


Artificial 




















1 R*l 

X J J 

1 RA 
1 


<22 0> FEATURE: 

<223> OTHER INFORMATION; 


: Fepti^ 








1 
I 




1 Rfi 
X DO 


<400> SEQUENCE: 


J 
























1 CO 

1 JO 


KcL 


vjX y 


lie 


nib 




Xie U 


r\ J- Ci 


J-*y s 


lieu 


Tl *=> 
xxe 


AT 3 
x a 


ASp 


v^» ~\ 

v a x 


Ala 
t\ x a 


Pro 


Ser 


1 

1 D .7 


X 








R 










1 0 

X \J 










1 R 

X ~J 




1 D 1 


Ala 
Ala 


Tie 

xxe 


Arg 


ulU 


Asri 


ASp 


Tl ^ 
lie 


Xjy s 


Q O T~ 
DCi 


xyr 


IT 11C 


vjx _y 


Arg 


iiy s 


Va 1 

v a i 


Ala 

Axa 


ICO 
1 O Z 








9 n 










9 R 
















1 C /I 

1 b 4 


xxe 


Asp 


Ala 


Ser 


lie L. 


Ser 


lie 


Tvt* 

- L y x 


Gin 


Phe 


Leu 


lie 


Ala 


Val 


Arg 


win 


ICR 






^ R 










A 0 










A R 

He -D 








jl b / 


LrXy 


biy 


Asp 


v ai 


lieu 




As n 


CZ~\ n 


Glu 


Gly 


Glu 


Thr 


Thr 


Ser 


Leu 


Upf 

ric u 


1 CQ 

xoo 














R R 




















1 / U 






irne 


Tyr 


Arg 


X 111 


xxe 


Arg 


Met- 
ric u 


VjIU 


As n 


uiy 


Tip 

lie 


Lys 


Pro 


Va 1 

vai 


i 71 


c r 
b d 




















7 R 










ft 0 


X / j 


Tyr 


Val 


irne 


Asp 




Lys 


XT J- 


xrl vJ 


n n 

V7 1 11 


lieu 


Lys 


oex 


r 1 v 
oiy 




lieu 


Ala 


1 7 A 
X / 4 










ft R 










90 










9 R 




1 7C 

x / b 


Lys 


Arg 


Ser 


Pin 
ulU 


Arg 


Ary 


Ala 




Ala 


Glu 


Lys 


Gin 


Leu 


Gin 


r* 1 n 
ii 


Al a 
nla 


17 7 
1 / / 








i no 










i n r 










1 1 n 

XXV 






1 7 Q 

x / y 


Pin 

bin 


ul U 


Al a 


uiy 




r; 1 n 


Glu 


V* 1 
v a x 


Glu 


Lys 


Phe 


Thr 


Lys 


Arg 


Leu 


v= 1 

V a X 


i ftn 

X o u 






1 1 q 
_L X _> 










1 z. \J 










1 9 R 

X ^ -J 








1 « o 


Lys 


Val 


Thr 


Lys 


Ul 11 


Ui c 

nio 


A o n 
noil 


A ct*i 


Glu 


Cys 


Lys 


His 


Leu 


Leu 


JCI 


Leu 


1 ft "3 




1 Ju 










1 ^ R 

X J J 










1 4. 0 

X *t V/ 










1 « R 
X O O 




biy 


xxe 


■ 

Pro 


r P\7T~ 

xyr 


Leu 




nla 


It J. (J 


OCX 


f^l ii 


Al a 
r\X a 




Ala 


oex 




1 ft£ 
X O D 


1 A R 
X 4 D 










1 RO 
l ju 










1 RR 










X D W 


1 ft ft 
XOO 


Ala 


a l a 

Ala 


Leu 


Ala 


Lys 


Aia 


kjX y 


Lys 


V=. 1 


xyr 


M.X a 


AT a 
M. X Ct 


AT s 
r\x a 


1 1 IX 


Pin 

ulU 


Asp 


1 ft Q 










i r 

ID J 










1 70 
X / u 










1 7 R 

X / -J 




1 Q 1 
X y X 




Asp 


Cy s 


Leu 


-L 111. 


lr lit: 


uiy 


ocl 


IT J. W 


v a x 


LlCU 


11" l_ 


A TfT 

r\x y 


Hi Q 

nx o 


Leu 


i ni 


1 Q O 

x y z 








X O \J 










1 ft R 

X O J 










1 90 






1 QA 

X _? fr 


Ala 


Ser 


oiu 


AXa 




Lys 


Xie LI 


IT J. <w» 


Tip 


Vj X 11 


m n 

yj X U. 


c i ic 


XT. X O 


T.f^i i 
xie u 


JCI 


Arg 


x y o 






j. y *j 










9 on 










9 0 R 








* 1 Q 7 

x y / 


Va 1 
Vdl 


Leu 


ijin 


bill 


Leu 


Gly 


Leu 


As n 


Gin 


Glu 


Gin 


Phe 


Val 


Asp 


L6U 


v-.y o 


1 Qft 

x _? o 




210 










91 R 










22 0 












Tip 
lie 


Leu 


Leu 


f "1 T / 

wljf 


Ser 


Asp 


Tvt 

i y j - 


Cys 


Glu 


Ser 


lie 


Arg 


Gly 


lie 




Ala 

M.X a 


9 0 1 


225 










Tin 
z ,? u 










9 ? R 
z> ~j ~j 










24 0 


9 m 

Z U j 


Lys 


Arg 


Ala 


Val 


Asp 


Leu 


lie 


Gin 


Lys 


His 


Lys 


Ser 


lie 


Glu 


Pin 
ulU 


Tl ^ 
xxe 


9 04 










245 










250 










2 55 




9 0 


Val 


Arg 


Arg 


Leu 


Asp 


Pro 


Ser 


Lys 


Tyr 


Pro 


Val 


Pro 


Glu 


Asn 


irp 


lieu 


207 








260 










265 










270 






209 


His 


Lys 


Glu 


Ala 


Gin 


Gin 


Leu 


Phe 


Leu 


Glu 


Pro 


Glu 


Val 


Val 


Asp 


Pro 


210 






275 










280 










285 








212 


Glu 


Ser 


Val 


Glu 


Leu 


Lys 


Trp 


Ser 


Glu 


Pro 


Asn 


Glu 


Glu 


Glu 


Leu 


Val 


213 




290 










295 










300 










215 


Lys 


Phe 


Met 


Cys 


Gly 


Glu 


Lys 


Gin 


Phe 


Ser 


Glu 


Glu 


Arg 


He 


Arg 


Ser 


216 


305 










310 










315 










320 


218 


Gly 


Val 


Lys 


Arg 


Leu 


Ser 


Lys 


Ser 


Arg 


Gin 


Gly 


Ser 


Thr 


Gin 


Gly 


Arg 


219 










325 










330 










335 




221 


Leu 


Asp 


Asp 


Phe 


Phe 


Lys 


Val 


Thr 


Gly 


Ser 


Leu 


Ser 


Ser 


Ala 


Lys 


Arg 


222 








340 










345 










350 






224 


Lys 


Glu 


Pro 


Glu 


Pro 


Lys 


Gly 


Ser 


Ala 


Lys 


Lys 


Lys 


Ala 


Lys 


Thr 


Gly 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/586 , 744B 



DATE: 02/01/2002 
TIME: 08:56:52 



Input Set : A:\9584-017.txt 

Output Set: N:\CRF3\02012002\I586744B.raw 



225 355 360 

227 Gly Ala Gly Lys Phe Arg Arg Gly 

228 370 375 

230 <210> SEQ ID NO: 4 

231 <211> LENGTH: 1930 

232 <212> TYPE: DNA 

C--> 233 <213> ORGANISM: Artificial 

235 <220> FEATURE: \ 

236 <223> OTHER INFORMATION/^cDN^^ 

238 <400> SEQUENCE: 4 

239 atgggaattc acggccttgc caaactaatt 
241 aatgacatca agagctactt tggtcgtaaa 

. 24 3 cagttcctga ttgctgttcg tcagggtggg 

245 accagcctga tgggcatgtt atggcaaacc 

247 tacgtctttg atggcaaacc accacagctg 

249 aggcgcgccg aggctgagaa gcaactgcag 

251 gtggagaagt tcaccaagag gctcgtgaag 

253 cacctcgtga gcctcatggg catcccttac 

255 tgtgctgccc tggcaaaggc tggcaaagtc 

257 ctcacttttg gcagccccgt gctaatgcga 

259 cccatccaag agttccatct gagccgcgtc 

261 tttgtggatc tgtgcatcct gctgggtagc 

263 gccaagcggg ctgtggatct catccagaaa 

265 ctggacccca gcaagtaccc cgttccagag 

267 ttcctggagc cagaagtagt ggacccagag 

269 gaagaagagt tggtcaaatt tatgtgtggt 

271 agtggggtca agcggctgag taagagccgc 

273 ttcttcaagg tgacaggctc actctcctca 

275 cctgctaaga agaaagcaaa gactggggga 

277 tgtccttccc ctccactgtc cttgacccca 

279 cagcacatcc ctcttgtccc tcgtcttgag 

281 agagctttcc ctctcttgac cctgtggcag 

283 tagctcagga aagatgtcag gctcaaacca 

285 gttctgtgca actgcgagca atgtcttaag 

287 ggagatagtt tcccagctgg ccagctggtg 

289 ctgctccttc taatttcact gtccctgaaa 

291 aagaactgca aagagacgca gcagagagaa 

293 agctgatttt tgtgggcaga aatttgaact 

295 ggagtgagat ggcagtgttt aagttttgat 

297 tccagtgttg gggattgacc ccagggcaaa 

299 caagtgatgt tctgacagcc tttctgaggc 

301 aactgttgtt cataggctat ttctatttta 

303 aattataaaa 

306 <210> SEQ ID NO: 5 

307 <211> LENGTH: 382 

308 <212> TYPE: PRT 

C--> 309 <213> ORGANISM: Artificial 

311 <220> FEATURE: 

312 <223> OTHER INFORMATION: peptide 



365 



Lys 



fee 



( 



gctgatgtgg 
gtggccatcg 
gatgtgctgc 
ateegcatgg 
aagtcaggcg 
caggctcagg 
gtcaccaagc 
ettgatgeae 
tatgetgegg 
cacttaactg 
ctgeaggage 
gaetactgeg 
cataagagca 
aactggctcc 
tctgtggagc 
gaaaagcagt 
cagggcagca 
getaagegea 
gcggggaagt 
ggctgtctat 
gagagttcat 
gaaggccgta 
cttctcaggt 
gaagaagaag 
gaggagaggt 
gatgcccatc 
gtctggctga 
tgctgcctgc 
ttgtagtttt 
ggcattaagt 
aatcaattga 
aaagatgtga 



cccccagtgc 
atgcctccat 
agaacgagga 
agaatggcat 
agctggccaa 
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aacacaatga 
ccagcgaggc 
ecaeggagga 
ccagtgaggc 
tgggtctgaa 
agagcatccg 
tcgaggagat 
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cccagggacg 
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tgcttccagc 
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tgtagtcatt 
gagegagget 
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tegctgatgg 
agtactgacc 
agttgtgcag 
ctctcccctc 
cctgtgcctc 
tgggagaaga 
aaaacaataa 
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235 
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Glu 
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He 
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Lys 
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290 
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300 
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Lys 
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He 
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Lys 
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He 
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Lys 
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Lys 


377 
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Gly 


He 
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Leu 
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Pro 


Lys 


380 
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Lys 
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Gin 
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Ala 


Gin 
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Asn 


Lys 
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360 
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Lys 
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